Mining and Classification of Multivariate Sequential Data

نویسندگان

  • Ariella D. Richardson
  • Noa Agmon
  • Tammar Shrot
چکیده

Multivariate sequence mining and classification are important and challenging tasks. They can be applied to numerous domains including medical diagnosis, handwriting deficiency diagnosis, identification of users for security or personalized TV services, and even transportation and traffic planning. The problem we address in this dissertation is classification of multivariate sequences. Multivariate sequences are sequences that have multiple attributes for each item in the sequence. Several attempts to address this problem exist, but none provide a full solution. One type of solution to this problem is to reduce the solution to a single attribute or non sequential problem while loosing valuable information. Other solutions address both the multivariate and the sequential aspect of the input but provide an unscalable solution. In this dissertation we first present COACH (Cumulative Online Algorithm for Classification of Handwriting deficiencies). COACH is a classification algorithm for multivariate sequences that uses heuristics to combine several single attribute classifications. COACH is evaluated on real data obtained from children with poor handwriting using a digitizer tablet. Results show that COACH manages to successfully differentiate between poor to proficient handwriting. Integrating several single attribute classifications encouraged us to search for a solution that uses all the attributes together in the classification process. The second part of the dissertation introduces frequent sequence mining. Frequent sequence mining, as well as being a challenging and interesting task, can be used for

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessment of an ore body internal dilution based on multivariate geostatistical simulation using exploratory drill hole data

Dilution can best be defined as the proportion of waste tonnage to the total weight of ore and waste in each block. Predicting the internal dilution based on geological boundaries of waste and ore in each block can help engineers to develop more reliable long-term planning designs in mining activities. This paper presents a method to calculate the geological internal dilution in each block and ...

متن کامل

Heavy metal pollution and identification of their sources in soil over Sangan iron-mining region, NE Iran

The aim of this study was to determine the extent of metal pollutions and the identification of their major sources in the vicinity of the Sangan iron mine occurring in NE Iran. Soil samples were collected from the vicinity of the mine site and analyzed for heavy metals. In addition, the chemical speciation of these metals was investigated by means of the sequential extraction procedure. The st...

متن کامل

Characterization of rare earth elements by coupling multivariate analysis, factor analysis, and geostatistical simulation; case-study of Gazestan deposit, central Iran

The traditional approaches of modeling and estimation of highly skewed deposits have led to incorrect evaluations, creating challenges and risks in resource management. The low concentration of the rare earth element (REE) deposits, on one hand, and their strategic importance, on the other, enhances the necessity of multivariate modeling of these deposits. The wide variations of the grades and ...

متن کامل

A New GIS based Application of Sequential Technique to Prospect Karstic Groundwater using Remotely Sensed and Geoelectrical Methods in Karstified Tepal Area, Shahrood, Iran

In this research, recognition of karstic water-bearing zones using the management of exploration data in Kal-Qorno valley, situated in the Tepal area of Shahrood, has been considered. For this purpose, the sequential exploration method was conducted using geological evidences and applying remote sensing and geoelectrical resistivity methods in two major phases including the regional and local s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011